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MATCHING PROCESS 

This invention relates to a matching process for use in a contention resolution scheme for 
a multi-stage switch arrangement particularly but not exclusively for a cell, packet or circuit . 
5 switch or network. In particular, but not exclusively, the Invention relates to a scalable 
hierarchical matohing algorithm, particularly but not exclusively suitable for matching 
asymmetric request matrices. 

The term "matching" refers to the matching of requests for transmitting input-queued 
10 traffic to available outputs when scheduling cells or packete for transmission across a 
switch. The term "input-queued traffic" refers to traffic buffered at the input ports of a 
switch prior to switching acnDss the switch fabric. . An overall scheduling operation 
comprises the matching process described herein and a time-slot assignment process 
which determines actual channel availability for transmission across the switch fabric. 
15 One example of a time-slot assignment process is described by the inventors In their 
United Kingdom Patent Application No. GB-A-0322763.4, the contents of which are 
hereby incorporated into the description by reference. 

The term switch is used herein to refer to switches and/or routers and/or networks which 
20 fonvard data towards their destination, such as are used in communication networi<s, for 
example, the IntemeL The present invention also relates to the matching of circuit- 
switched service requests, such as connections and information rates, for switching 
across a switch fabric. This description is written in terms of cell and packet switches, but 
the principles also apply to circuit switches (for example, in the context that the matching 
.25 process seeks to grant sen/ice requests without contention, and the services requests can 
equivalently be requests for bandwidth etc. in a circuit switch). 

As communication networks, particulariy the intemet, evolve, faster and more efficient 
switches are needed, for example, switches capable of exceeding Terabit per second 
30 throughputs. There is therefore a demand for faster and more efficient high-throughput 
schedulers to schedule traffic through such switches, and therefore a demand to generate 
computationally faster and more efficient scheduling algorithms. 

Packet switching Involves tiie svwtching of data in packets through a data networi<. An 
35 arriving packet could be variable or fixed length, unicast or multicast. A packet is 
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multicast if it has more tlian one destination port Variable length and/or multicast packets 
can be transfen^ed to fixed-length unicast packets by methods well known in the art, and 
the tenn "ceir is used to refer to a fixed-length unicast data packet. >^ cell consists of the 
header and payload, and each cell has a unique identifier, a sequence number and the 
5 destination address (the destination output port number) of the cell which is encapsulated 
in the header. 

Input Queued Switchino Schemes 

10 Rgure 1 of the accompanying drawings shows a general model of an N x N switch where 
of the N input and N output ports, only three input and three output porte are shown for 
convenience and clarity. Accordingly in Rgure 1, switch 1 is shown having input ports 2a, 
2b,..., 2n and output ports 3a,3b,...,3n. Each input port 2a, 2b,..., 2n is provided with one 
(or more) Input buffers 4a,4b,...,4n respectively, the buffer(s) for each port being controlled 

15 by one or more buffer controllers 5a,5b,...,5n respectively. In a virtual output queued input 
queued switch, a number of virtual output queues {VOQ) are provided: each input port 
having a VOQ for each destination port (i.e., each input port in the N x N switch has N 
VOQs) and pointers are used to point to the addresses of the cells In each VOQ. VOQs. 
arp described in more detail later herein below. 

20 

A scheduler 6 is used to schedule the transmission of the cells arriving along the Input 

links 8a,8b,...,8n to their destination links 9a, 9b 9n. The scheduler 6 determines 

which cells from which VOQs traverse the switch fabric 7 during a switch cycle. The 
function of the scheduler can be distributed between the Input and output porte, such that 
25 each input and output port has an arbiter associated with it, either physically or logically. 
Generally, a scheduler operates to switch one cell per timeslot, i.e., one cell is switched 
per period of lime for a cell to be transmitted across the switch fabric 7. However, frame- 
based schedulers are known in the art in which a plurality of cells are switched over a 
plurality of timeslots. The operation of the switch is then synchronised over a plurality of 
fixed-size timeslots, which constitute a frame. 

In Rgure 1 , the switching fabric 7 comprises a suitable interconnecting network in the form 
of single-stage or multiple-stage space and/or wavelength switches. Some or all of the 
wavelength switches can be implemented as wavelength-switched networks. Rgure 1 for 
clarity only shows the possible internal Input-output links 10a,10b,...,10n for input port 2a, 
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but each input port 2a,2b,..,2n will have possible internal input-output links connected 

through the switch fabric 7 towards the appropriate destination output ports 3a,3b 3n. 

Each Intenjal Input-output link within the switching fabric 7 Is assumed to be capable of 
transmitting data at a speed of one cell per timeslot. It Is not necessary for an Input link 

5 8a,8b,...,8n (each of whteh connects to their respective Input port 2a,2b 2n) to the 

switch to operate at the same speed as an Internal Input-output link (e.g., input-output link 
10a,10b.....10n) virithin the switoh fabric 7. 

During each timeslot the Interconnecting networic of the switch fabric 7 is capable of being 
1 0 configured by the scheduler to simultaneously set up a set of transmission paths between 
any pair of input ports 2a^b.....2n and output ports 3a.3b....,3n provided no more than a 
predetermined upper limit of cells are transmitted by an Input port 2a.2b...,2n or received 
by an output port 3a,3b 3n during each frame. 

15 If the packet switch 1 Is to process variable sized packets, or non-unteast packets, the 
appropriate conversion steps into fixed sized packets (or cells). Is assumed to have 
already occun^ and thus these components are not shown In Rgure 1. In Rgure 1, each 
input link 2a,2b...,2n provides fixed sized packets (I.e. ceils) to cell Input buffer 4a,4b,..,4n 
and buffer controllers 5a,5b,...5n respectively for header translatfon, addressing, and 

20 management functions which are perfonned on the Incoming ceils. The scheduler 6 
processes the fixed-sized ceils so that the switch fabric 7 operates In a synchronous 
manner. 

The role of the scheduler 6 thus comprises matching each cell rBskfing in an input buffer 
25 to its destination output port. Thus the scheduler 6 can be considered to be repeatedly 
solving a bipartite matching problem for each timeslot, in the manner described by 
Anderson et ai, "High-speed switch scheduling for iocal-area-networics", ACM 
Transactions on Computer Systems, vol. 11, no. 4, pp 319-352. By providing an 
appropriate match, e.g. matching a maximum number of Input ports to output ports, or 
30 matching a maximum weighted number of input porte to output ports, in each switch cycle, 
the scheduler 6 is considered as treating the queued traffic in a useful and fair manner, 
depending on the nature of the traffic matrix. 

In general, fixed-size packets (cells) are assumed to be switched In the switch fabric 7 to 
35 support high speed operation of the switch 1. As was mentioned above. If variable length 
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packets are to be supported in the network, such packets are segmented and/or padded 
Into fixed sized cells upon anival, switched through the fabric of the switch, and 
reassembled Into packets before departure. 

5 Output contention can arise when cells destined for the same output port anive 
simultaneously at the switch 1 at more than one Input port. To suppress cell losses, such 
cells are buffered by the switch 1 until they can be transfened to their destination output 
ports. The operation of the matching algorithm can potentially cause input contention, 
where more than one cell could be scheduled for transmission across the switch fabric 
10 from the same Input port This must be avoided by the matching algorithm. Whilst switch 1 
supports a virtual output queuing (VOQ) scheme for the Input queuing (IQ), a number of 
altematlve queuing strategies are also known In the art, output queuing (OQ), shared 
queuing (SQ), and combined input-output queuing (CiOQ). 

15 In a conventional Input queued switch, basic input queuing (IQ) avoids using high- 
bandwidth buffers by providing a buffer for each input port for incoming packets. With this 
queuing scheme, the bandwidth demand of each input buffer is reduced to at least one 
write operation and one read operation per time slot. With a properiy designed scheduling 
algorithm, a set of input-output contention free cells Is selected from the buffered ceils for 

20 transmission to their destination output ports, from time slot to time slot When the overall 
scheduling operation is applied to a number of timeslots simultaneously in a frame of 
timeslots, scheduling comprises two sub-processes, matching and timeslot assignment. 
These processes will be described in more detail later herein bek>w. 

25 Whilst output queued (OQ) switches and shared queue (SQ) switches can generally 
achieve better perfomiance than input queued switches and combined input-output 
queued switches, this is only so for a finite size of N X N switch. As the number of Input 
and output ports of the switch increases, the bandwidth demand of the OQ or SQ buffer 
grows lineariy as the aggregated Inpiit-output link rate increases. AcconJingly. it is known 

30 in the art that OQ and SQ switches generally do not scale very well. As the switch 
architecture of an input queued (IQ) switch with FIFO queuing (and similariy a Combined 
Input-Output Queued (CIOQ) switch) is much simpler than that of OQ and SQ switches. 
IQ and CIOQ switches generally scale better than OQ and SQ switches as each input 
buffer maintains a single FIFO for all incoming cells. However, despite the simplicity in 

35 the switch architecture of an IQ switch with FIFO queuing, the maximum throughput is 
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relatively low for uncorrelated (Bernoulli) traffic with destination outputs distributed 
unlfomily (for example around 50%-60% or so), and the throughput is worse for con-elated 
(on/crff bursty) traffic. This is a result of the HOL biocldng problem, in which a cell queuing 
behind the HOL cell of a FIFO cannot participate in scheduling, even if both Its residing 
5 input and destination output are idle. 

By supporting Virtual Output Queuing (VOQ) in the input ports of an IQ switch, HOL 
blocking can be removed. The Virtual Output Queue (VOQ) scheme (also Imown as the 
multiple input queuing scheme) is described in The iSLIP Scheduling Algorithm for input- 
10 Queued Switches^ by N. McKeown, IEEE/ACM Trans. Networidng, Vol. 7, No. 2. pp. 188- 
200 (April 1999), and United States Patent Number US 5500858, the contents of which 
are hereby incorporated by reference). 

Conventionally, in an input-buffered VOQ switch, a fixed-size padcet (or ceil) is sent from 
15 any input to any output, provided that, in a given timeslot, no more than one cell is sent 
from the same Input, and no more than one cell Is received by the same output Each 
input port has N VOQs, one for each of N output ports. The HOL cell in each VOQ can be 
selected for transmission across the switch in each timeslot Accordingly, In each 
timeslot a scheduler has to detennine one set of matching. I.e., for each of the output 
20 ports, the scheduler has to match one of the con-esponding VOQs with the output port. 

Rgure 2 of the accompanying drawing shows schematically a 4 x 4 VOQ IQ switch 20. 
Switch 20 has four input ports #a1 . #a2, #a3. and #a4 and four output ports #b1 , #b2, #b3, 
and #b4 which are capable of being interconnected by an internal switch fabric 21 . Each 
25 Input port #a1, #a2, #a3, and #a4 has four VOQs, one VOQ for each of the destination 
output ports #b1, #b2, #b3, and #b4. In Rgure 2, the VOQs are denoted VOQ#ai#bj 
where i, j ranges from 1 to 4 respectively. 

It is known in the art that the implementation of a VOQ scheme can enable up to 100% 
30 throughput to be achieved. Scheduling algorithms such as maximum weight matching 
algorithms have a high level of complexity, e.g. the number of calculations to be 
performed per single time-slot matching is O(N^). Currently, the amount of time it would 
take to perfomfi such an algorithm to calculate the matching is impractical under high- 
speed environments where the duration of a time slot (the time taken to run a switehing 
35 cycle i.e., to transport a cell from an input port to its destination port across the switch 
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fabric) Is very small. 

Other scheduling schemes known in the art Include a three-stage switch scheduling 
scenario in which a large packet switch is decomposed Into a number of smaller switch^ 
having fewer ports (see Joseph Y Hui In "Switching and Traffic Theory for Integrated 
S Broadband Networks^, Kluwer Academic Publishers, 1990, Chapt 5, and J S Turner, 
"WDM Burst Switching for Petabit Data Networks", OFC 2000 presentation). However, in 
these known schemes each switch has Its own buffering which requires scheduling to be 
perfomfied independently for each switch. No.scheduling occurs between the switches, 
which leads to at least two limitations. Firstly, heuristic mies are required (e.g. load 
10 spreading between switches) to enable a reasonable switch perfomiance to be 
maintained. Secondly, packets may arrive at their destination out of sequence. 

Contentton Resolutton in an Incut Queued \/lrtual Output Queued Svritch 

15 Consider the N X N switch shown In Rgure 1. In Rgure 1, the switch 1 has N input 
queues in each input port Accordingly, there are N* VOQs in total. However, switch 1 
has only N output ports to transfer at most N ceils to in a given timeslot. Thus contention 
occure amongst the VOQs. 

20 Several methods are known in the art which seek to resolve this contention issue. One 
known technique to reduce the computing time complexity is to use a heuristic maximal- 
sized matching algoritfim such as tfie ISLIP scheduling algorittim by N.iy/teKeown. Uke 
many matching algorithms, ISLIP comprises 3 phases known as tiie request, grant, and 
accept phases. In the request phase, each of ttie N^ Input queues sends a request to the 

25 output ports. In ttie grant phase, each of ttie output ports grants one request among Its 
own receiving requests using a suitable selection operation and notifies the result of grant 
to each of the input ports. An input port may receive several grante from each output port 
at the same time so that in the accept phase each of the Input ports accepts one grant 
amongst Its own receiving grants using a suitable selection process. Several request- 

30 grant-accept cycles are iteratively performed. 

With such a three-phase matching approach, a problem which needs to be addressed to 
optimise the matching process is how to ensure that tfie selection processes fairly and 
quickly select and grant one request from a plurality of requests which could be granted 
35 (and equivalentiy accept one grant from a plurality of grants which could be accepted). In 
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ISLIP, this selection process is acliieved using a particular set of pointer rules. iSUP can 
be faster than alternative schemes which use random selection. Unfortunately, in the 
iSLIP algorithm, as the number of input ports and output ports Increases, the''number of 
requests and grants which must be selected between in the grant and accept phases 
5 within one time slot Increases. Although the iSLIP algorithm has less computing 
complexity than a maximum matching algorithm it has a limitation: ISLIP requires the 
maximal matching to be completed within one timeslot. Again, as the switch size 
increases or if a switch has very high port speeds (either because the matching time itself 
increases beyond the time for one time slot, or because the timeslot itself has a shorter 
10 duration) ISLIP is no longer suitable. 

Several other matching schemes have been devised in the art which seek to provide 
greater scalability and so support faster switch cycles. For example, pipeline-based 
scheduling algorithms such as the Round-Robin Greedy Scheduling (RRGS) allows each 

15 input to perfbnn only one round-robin arbitration within a given time slot to select one 
VOQ. Fbr a switch with N inputs, N. input round-robin operations (to select a cell to be 
transmitted at a given time slot T) are allocated to the different previous N time slots,{T-N, 

"•"-(N+l) T-1) in a simple cyclic manner to avoid output contention. A drawback of this 

scheme is when traffic is not balanced across the input of the switch, some inputs can 

20 unfairly send more cells than others. Whilst other schemes are known in the art to 
guarantee pre-reserved bandwidth, for example, the weighted RRGS scheme, this has a 
drawback in that it does not guarantee fairness for best-effort traffic and a further draw 
back in that as every even number of timeslot cycles an idle timeslot is produced resulting 
in the switch capacity not being fully used. 

25 

Oven/iew of Frame-Based Schedulinn 

As was discussed briefly in the introduction, the overall scheduling operation comprises 
two sub-processes, a matching process and a timeslot assignment process. A similar 
30 division exists where a frame-based scheduling approach is implemented. 

The frame based approach comprises two steps for each frame. The first step involves a 
matching process in which a number of cells queued at the inputs are accepted for 
transmission to outputs in a non-contentious manner. The second step involves a time- 
35 slot assignment process in which the successfully matched cells are scheduled for 
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transmission in the different time slote of tlie frame. Wis time-siot assignment step can 
be considered to be equivalent to sclieduling a set of non-conflicting requests in a time- 
frame, wiiich can be performed using Icnown patfi-searcliing algorisms such as those 
used to route circuits in a Cios interconnection networic, for example, see WO01/67783 
5 "Switching ControF and also WO01/6780y Frame Based Algorithms for Switch Contror, 
and WO01/67802 "Pacl<et Switching", ail three of which are hereby incorporated by 
reference. 

At the beginning of a frame, the total numt>er of packet requests from each Input port to 
10 each output port as a pair is collected into an N x N Request Matrix R (the request phase 
of the process). Each element r(ij) of this matrix is an integer showing the total amount 
of stored packets in the VOQ between input port i and output port j. 

The matching process populates a symmetric N x N Accepted-Requests matrix A. Each 
15 element. a(i,j) of A represents the total number of accepted switching requeste from the 
VOQ between input port i and output port j, i.e., requests that have been accepted to be 
switched during the following time period (frame) available for transfening one or more 
cells between an Input port and an output port using one or more timeslots. Each 
accepted request a(i,J) of A is constrained by the overall capacity of the switch input and 
20 output ports ''P , i.e., the sum of elements in each row and each column must not exceed 
the frame size F; .i.e. the number of time slots or cells in the frame. Various matching 
algorithms are known In the art to try to optimise the use of the available switch capacity. 
All of these matching algorithms seek, in each time period consisting of one or more time 
slots, to determine a non-conflicting match between the input ports and the output ports of 
25 a switch fabric of an N x N symmetric request matrix. 

For example, where F = 1 (and for unicast traffic) a matching process will seek to link 
each input port to at most one output port and each output port is linked to at most one 
input port. A complete matching of the input ports to the output ports in one timeslot is 

30 then equivalent to determining the appropriate permutation of the input ports. However, 
as a complete matching cannot always be achieved, ma)dmal matching algorithms seek to 
optimise the selection of which ceils should be transmitted from input to output per 
timeslot. This optimisation depends on a number of factors selected according to the 
particular emtx^diment of the matching algorithm implemented and can depend, for 

35 example, on the length of queue and/or how long the cell at the head of each queue has 



wo 2005/032167 



9 



PCT/GB2004/004159 



been queued for. 

Frame-based matching where F ^ 1 has already been described in the art, for example, In 
"Frame-based matching algorithms for input-queued switches^ by Andrea Bianco, Mirko 
5 Franceschinis, Stefano Ghisoifi, Alan Michael Hill, Emilio Leonard!, Fabio Nerl, Rod Webb, 
HPSR 2002, Worl<shop on High Perfomiance Switching and Routing, Kobe, Japan, 26-29 
May 2002, the text of which Is incorporated herein by reference. Bianco et al describe a 
frame-based switch contention resolution scheme which can be considered to comprise 
two-steps for each frame (a frame being considered to be a set of one or more time-slots). 

10 

Consider a frame whose length is F(l.e., whose transmission could occupy F consecutive 
timeslots). A set of cells selected to be transmitted in the timeslots belonging to the next 
frame is selected at the end of the current frame, i.e., switch control occurs on a multi- 
time-siot basis at the edge of each frame boundary. The set of cells selected for 

15 transmission is temned the F-match, and this needs to always comply with the Joint criteria 
that i) the total number of selected cells from each input port cannot be larger than F and 
li) the total number of selected cells which are to be transmitted to each output port cannot 
exceed F. Equivalently, if aij Is the number of accepted cells from input i to output j, the 
constraints are: 

20 Eqn. 1 Y,^J ^ i and 

Eqn.2 

J 

The selection of the cells forming the set to be transmitted in the next frame is made using 
an Fmatching algorithm (and where F=1, the Fmatching is equivalent to a conventional 
time-slot by time-slot approach). Once the Fmatch has been obtained, cell transmissions 
25 need to be assigned to different timeslots of the frame, i.e., a set of at least F switch 
permutations capable of transferring all cells belonging to the F-match in a non-conflicting 
manner. 

The frame-based matching scheme Bianco et al describe is implemented using a 
30 request/grant/accept scheme in a manner similar to iSLIP. ISLIP utilises a rotating priority 
scheme in which the selection of requests to be granted (at outputs) and of grants to be 
accepted (at inputs) is implemented using two sets of N pointers, one for each input and 
one for each output An output (input) pointer points to the input (output) port to which 
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highest priority Is given in issuing grants (acceptances). Accordingly, grants and 
acceptances are given to the first busy queue in a cyclic order starting from the current 
pointer position. Input and output pointers are up-dated after each matching to the first 
input (output) following the one which has been accepted. 

5 

Bianco et al also describe a "No Over Booking^ (NOB) matching algorithm consisting of a 
generalisation of the known iSLIP algorithm by McKeown et al, but one or more iterations 
(i.e.,. a generalisation of 1-SLIP) and associated pointer update rules. The NOB algorifiim 
output booking and input booking steps are described in detail in Bianco et al, and are 

10 incorporated her^n by reference. Briefly, the NOB algorithm steps tiirough an output 
booking phase followed by an input booking phase, similariy to iSLIP. In the output 
booking phase, each virtual output queue (VOQ) requests a number of time-slots in the 
appropriate output frame, and as a reply eadi output port issues up to F grants distributed 
amongst the N VOQs destined for that output. The total number of requests is 

15 represented by a request matrbc R, whose elements r^ represent the total number of time 
slots requested by input port / for output port J. 

In general the length q^ of each VOQ (i*e. the number of cells queued) can be greater 
than the frame lengtti F. The number of actual requeste made by each VOQ from the 
20 request matrix R is up to qy when qy < Fbut If qy 2t Ftiien up to F (as no more tiian F 
time slots can be requested at any one time). The request matrix R is distinguished from 
the normalisation phase matrix to be discussed below which uses as its input ''requeste" 
the actual queue lengths, but which does not reduce the number of requests in each VOQ 
to the frame lengtti F prior to the first stage of matching. 

25 

During the output booking phase, each output port operates simultaneously, hence output 
porte operate Independentiy, so that there is no guarantee that tiie total number of grants 
received by VOQs at one input port will not exceed the capacity of the input frame. To 
remedy this, each input port accepts up to F of the grants recfeived at that port. Each 
30 acceptance received by a VOQ at one input port gives that port the right to transmit one 
ceil in the next frame. 

The NOB frame matching algorithm Bianco et al describe is in some sense therefore a 
hybrid between a maximum weight matching (MWM) (which assigns a weight to the cells 
35 at ttie head of each VOQ, and which optimises the cumulative weight of cells which are 
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successfully matched) and a maximum size matching (MSM) (which addresses optimising 
on the t}asis of the overall number of cells which are successfully matched being a 
majdmum}.. In each phase, the final steps fn the above algorithms begin on an Initial VOQ ^ 
which Is indicated by a pointer. Accordingly, each output port maintains a pointer showing 
5 which input port should be given priority for its additional grants in the final output booking 
step, and each input port keeps a pointer showing Which output port has priority in its final 
input booking step. Several schemes are known In the art for updating the pointera so that 
a level of fairness is maintained. 

10 Prior art such as United States Patent number US 6.487,213 entitied "Methods and 
Apparatus for Fairiy Arbitrating Contention for an Output Port" by Chao describe 
hierarchical arbitration methods in which requests are aggregated together, but arbitration 
is performed independentty for different output ports, i^atching is not perfomned globally 
between all input and output ports to solve contention across tiie entire switch fabric, nor 

15 does Chao address tiie issue of resolving contention in an input-svwtch, as Chao 
addresses the issue where both input and output queuing are provided. 

Consider when the original request matrix IRo Is transformed to a. normalised request 
matrix R«OTn by transfomnation factor d. The original request matiix Rq could be, for 

20 example, the matrix of VCX3 queue lengttis (i.e. numbers of requeste or cells queued in 
each VOQ) or a measure of tiie requested traffic rates. An example of a transfonnation 
factor d is described later. Rr = Ro - Rnom. is tiie request matrix of remaining requeste 
given by ttie original request matiix Ro and R„eim is the partially populated Accepted- 
Requests matrix A from the first stage of tiie matching. Rr is used to fill up as much of tiie 

25 remaining capacity of the frame as possible, by running another matching algorithm 
(which could be tiie same as the first or different) in a second stage to populate a second 
accepted requests matrix A2 derived from the matiix of remaining requests R^ The final 
matiix of accepted requests A = Rnorm + A2, i.e., A is just tfie sum of tiie two matrices found 
during the two stages. Consider tiie following example request matrix 

^3 4 2 0"^ 

.= o n 5 0 10 

30 Eqn. 3 it, = 

^ 8 5 13 

,2 0 2 6j 

This is ti-ansfomied by a tiransfprmati'on factor d= F/max (F.mvai), where F = 8 and mval is 
defined to be the maximum sum of any one column or row in Ro, i.e.. 
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Eqn. 4 mval = max(2^/jj i^j^4,^nj i^i^4) 



Here d = 8/18 and thus Ro = L4R/9J, where the elements of Rq are the integer parts of the 
resulting numbers. For this example, then 





'I 


1 


0 


6) 




2 


0 


0 


0 




3 


2 


0 


1 




.0 


0 


6 


2j 



5 The remaining request matrix = 

^2 3 2 0^ 



Eqn, 6 R, = 



3 0 10 
5 3 12 
1^2 0 2 4j 

A second matching procedure is then performed on the remaining request matrix Rr which 
produces another Accepted Requests matrix Aa. The total Accepted Requests Matrix A 
per frame is then given by the sum of Rtoito and A2, i.e., A = Rnorm + A2. 

Therefore the operation of an existing, example single-level matching algorithm known, in 
the art can be summarised as follows. 



Normalisation Staoe 

15 Rrst the elements in the request matrix are transformed by normalising them according to 
the highest queue value and the total number of timeslots available in a frame, so that in 
the normalisation phase: 



20 



25 



Eqn. 7 [r(i,j)l => [rnonnOj)], rr(i j)l 

"No Overboolcina" Stage 

This comprises an output booldng phase followed by an input booking phase. In 
the output booking phase, a granted-request matrix is formed from the matrix of remaining 
requests, i.e., in the output port booking phase the grants are derived as follows: 

Eqn. 8 [rr(i j)] =:> [g(id)l : ^ 9(M)^F-[5] Qnonn(ij)l, 
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Where Qromi "s the number of normalfeed requests queued for a particular output port In 
the Input booking phase, an accepted grant matrix is generated from the matrix of granted 
requests, I.e., the accep^d grants populate this matrix according to: 

Eqn.9 . Ig(i.j))=>la(i.j)l; £ a(i,j):sF 

) 

5 

Time Slot ^ianment 

The second process off the scheduling algorithm is the Time Slot Ass^nment it attempte 
to compute the set of switch (or networic) configurations for each time slot, such that the 
matrix of accepted requests can be transferred from the Input ports to the output ports 
10 across the switch without blocking any packet, i.e., to ensure there is a free time > slot 
available for each padcet from its Input port to its desired output port This Is not always 
possible, depending on the Time Slot Assignment algorithm and the number of time slots 
(switch pemiutations) available. Some or even all of this set of switch pennutations may 
be tfie same. As an example, consider the request acceptance matrix 

15 Eqn. 10 A = {a{iJ),\^ihj)^^]= 

A pos^ble set of 8 switch permutattons to send these numbers of cells, or padcete from 
input ports to the output ports (elements in the table) across the switch is shown In the 
following table 



Time slot > 
Input Port ▼ 


1 


2 


3 


4 


5 


6 


7 


8 


1 


1 


1 


2 


2 


2 


2 


3 


3 


2 


3 


4 


1 


3 


1 


3 


2 


1 


3 


2 


2 


4 


4 


3 


1 


1 


2 


4 


4 


3 


3 


1 


4 


4 


4 


4 



20 

If we call this set of permutations Pn, where n is the time slot within the frame (1 ^ n ^ F ), 
then it would correspond to the following sequence of penmutation matrices 



2 4 2 0 

3 13 1 
2 3 12 
10 2 5 
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1 


0 


0 0 


0 


0 


1 0 


0 


1 


^ 0 


0 


0 


0 1 



ft- 



10 0 0 
0 0 0 1 
0 10 0 
0 0 10 



0 0 10 
10 0 0 
0 10 0 
0 0 0 1 



Eqn, 11 

Several algorithms are known In the art suitable for implementing time slot assignment, 
5 with different blocldng characteristics, dependent also on the numt)er of time slots (switch 
permutations) available. 



Referring now to Figure 2 of the accompanying drawings, an input queued cell (or packet) 
switch arrangement Is shown. In Rgure 2, each of the four input ports #a1..#a4 has four 

1 0 first-in-first-out virtual output queues (VOQs), designated as VCX2a1 b1 ...VOQa4b4. Each 
of the VOQs associated with the same input port stores cells destined for a different 
output port, the total length of the queue from input port i to output port j (the number of 

cells to be transmitted in the next time frame) being indicated by qij where i, J 1 , 2, N. 

Cells which queue up in ttie VOQs generate requests which can be presented by a queue 

15 matrix as shown below (and in Rgure 2). In general, the number of cells queued in a VOQ 
can exceed the frame size F. 



20 



As an example of a conventional single-level matching an example queue matrix [Q(i,j)] 
such as is shown in Figure 2, for a 4x4 switch having a frame duration F of F=4 time slots 
Is matched below. The VOQ lengths, i.e. the number of cells or packets waiting in each 
VOQ, are assumed to have a '^wers of two" distribution, i.e. 



Eqn. 12 



[Q(lj)l = 



12 4 8 
2 4 8 1 
4 8 12 
8 12 4 



A conventional single-level frame-based matching process in which a number of stages 
25 are present in the matching process will now be described. 

Single-Level Matching Nomialisation Stage 

The normalisation algorithm first finds the row (input port) or column (output port) with the 
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largest sum of requests (or queue lengths), maxval. Every request (or queue length) Is 
then normalised to this maximum value, firstly by being multiplied by the ratio c=F/maxval 
if ma)^al Is >F (or greater than the maximum number elf grants or acceptances allowed) 
and c=1 othenAfise, and secondly by taking the Integer part of the resulting number. 

For [Q(ij)] In Eqn.12 maxval = 15, which is larger than F (=4). Hence the nomnalised 
queue matrix becomes 



Eqn. 13 



[Qrionn(ij)] = 



0 0 12 
0 12 0 
12 0 0 
2 0 0 1 



All of these ceils or packets are assumed already to be granted by the output ports and 
10 accepted by the input ports. The request matrix presented to the second "no overbooking" 
stage is the difference between the original queue matrix and the normalised queue 
matrix, i.e. the remaining requests 



Eqn. 14. 



[r(lj)]ou!put =[Q(ij)]-[Qnonn(i.j)] 



15 



20 



These requeste are used by the "Vio overiDooking" stage to fill up the remaining available 
time slots in the frame as much as possible. 

Sinole-Level Matching ^A/o Overbooking" Stage Output Booking Phase . 



The number of requests in effect already granted by the output ports in the nomialisation 
stage is 

25 Eqn. 15 Q„onn(i j)] = [3 3 3 3] 



The remaining number of grants available in each output port is therefore 
Eqn. 16 [f f f fJ-[3 3 3 3]=[i 1 i 1] 
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Output booking operates simultaneously In each output port in the three following steps: 

1 . if the totel number of requests received by the port is less than the remaining 
number of grants available, then all requests to the port are granted. 
5 2. if the number of VOQs with an unsatisfied request destined for the port is less 

than or equal to the remaining number of grants, then the VOQs receive one 
grant each. This step Is repeated as many times as possible. 
3. taking the VOQs In turn, starting from the one indicated by a pointer, each 
VOQ with an unsatisfied request receives one grant until the total number of 
0 grants given by the port reaches the remaining number of grants In Eqn.16. 

Step 1 does not apply in this case, because the total number of inquests [J] r(i,j)]output is 

t 

12 to all ports (from Eqn.14). Step 2 does not apply either, because there aiB 4 VOQs 
with unsatisfied requests destined for every output port and only one available grant each. 
1 5 Step 3 applies in this case. 

Single-Leve l Matching Pointer Update Rules 

A detennlnistlc NOB25 pointer up-date rule such as that described by Bianco et al 
20 Initialises the pointers so that each output port gives priority to a different input port (and 
vice versa) and the pointer advances by 1 each frame, such that on cycle k of the 
algorithm (I.e. for the k-th frame, starting at k=0), port P gives priority to port p, where 



25 



Eqn.17 P=1+I(LN-P + k)^uJ 



Hence In the first frame k=0, with LN=4 In this example, output port 1 points to input port 
4, 2 points to 3, 3 points to 2 and 4 points to 1, i.e. the pointers point to VOQ requests 
r(4,1), r(3,2), r(2,3) and r{1,4) in Eqn.14. All of these VOQs have 6 requests, and because 
Eqn.16 allows only 1 more available grant for each output port, each of these four VOQs 
30 will be granted one more request; i.e. the additional output booking grants [g(ij)l are given 
by 



Eqn.18 [g(|,J)] = 



0 0 0 1 
0 0 10 
0 10 0 
10 0 0 
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SInale-Level Matching ^A/o Overbooking^ Staae Input Booking Phase 

The number of requests In effect already accepted by the Input ports In the normalisation 
phase is 



5 Eqn. 19 



Qnonn(lj)l = 

i 

The remaining number of additional acceptances available in each Input port is therefore 



Eqn. 20 



Qnonn(i.j)l = 



4 


3 




1 


4 


3 




1 


4 


3 




1 


4 


3 




1 



The request matrix for this Input booldng phase Is tfie additional output bootdng grants 
10 matrix [g(lj)] (Eqn.18). Step 2 of the "no overtxsoldng' algorithm applies, so ail of the 
additional output booking grants are accepted, i.e. the additional input booking 
acceptances [aaddwonaiOfj)] are given by 

0 0 0 r 

Eqn. 21 laaddMonalOJ)! = Olio 

1 0 0 0, 

The final acceptance matrix Is the sum of the acceptances from the Initial normalisation 
1 5 (Eqn. 1 3) plus these additional acceptances from the "no overbooking" algorithm (Eqn.21 ), 



Eqn. 22 



[a(ij)] = 



(" J)] + [aadcfrtloiial(ij)] 



0 0 1 2 0 0 0 1 

0 12 0^0010 

1 2 0 0 0 1 0 0 

2 0 0 ij [l 0 0 0 
"0013 

_ 0 1 3 0 
13 0 0 

3 0 .0 1 

All Input and output ports fill all F=4 time slots In this first frame. A full set of 16 cell or 
packet requests has been accepted In the first cycle or frame. They are taken from 8 of 
the longest VOQs In Eqn.12. 



25 



A matching algorithm may not completely fill up the matrix of Accepted-Requests from the 
matrix R (i.e., ensure A -R = a null matrix, indicating all requests have been granted). This 
is partly because A is constrained by the fact that the sum of elements a(l j) in each row 
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and column cannot exceed the number of timeslots in a frame length, F. Referring now 
back to the example request matrix given in Eqn.3 and with frame duration F = 8 

• timeslots, then both 2l^i.j^ ^ ^ jand^^i;^ 8 V i and by the constraint 

1=1 j.i 

that the total of the sums of elements in each column and each row is limited to the total 
5 number of available timeslots N X "P, i.e., 

SS^/j^ N X F, 

Where N = 4 and F = 8 in this example. This is the maximum possible numbfer of 
acceptances for the frame. But the matching algorithm may not be able to achieve this 
maximum number. 

10 Thus, as a more specific example, consider where the matching algorithm populates a 
matrix of Accepted-Requests as follows: 

^2 4 2 0^ 
3 0 10 

A = 

2 3 12 
^1 0 2 5^ 

4 4 

Here F= 8 and N x F = 32, however, = 28 meaning that the maximum switch 

.=1 i=i 

capacity of 32 requests per time period of 8 time slots has not been utilised. 

15 

The above example illustrates clearty one limitation of such a known frame-based 
matching process, in that the switch capacity may not be utilised fully, resulting in some 
redundant switch capacity in any given frame. 

20 The present invention seeks to obviate and/or mitigate some of the problems related to 
optimising matching algorithms so that their computational complexity is further reduced, 
yet which can more efficiently utilise the switch capacity. Ideally the computational 
complexity is reduced to a level which is suitable for the high-speed switches which are 
currently being developed for future use. The invention provides a frame based matching 

25 algorithm which seeks to obviate and/or mitigate some of the problems known in the art 
related to optimising matching algorithms by seeking to further reduce the number of 
computing steps in a frame-based matching process from 0(LN) to 0(L) or 0(N) for the 
frame. 
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A first aspect of the invention pFo>^des a mateliing method for a number N of first 
elemente, each f ir^ element ananged to at least provide ingress to a switoh arrangement, 
each of the first N elements comprising a number Li of first sub-elements, the switch 
5 an:angement having a number MLg off second sub-elements ananged to at least provide 
egress from said switch an-angement, and wherein each of the firat Li sub-elements is 
capable of conveying a service request for at least one of said second sub-elements MLg, 
wherein the method comprises: firstly, for every one of the N first elen^ents, aggregating 
service requests from all Li first sub-elements to each of the MLz second sub-elements, 
10 and secondly, resolving contention for said serAce requests from all N first elements to 
one or more of said second MU sub-elements, and thirdly, for each first element, 
resolving contention between the Li sub-elements and said second sub-elements. 

TTie step of resoMng contention between ttie Li sub-elements and said second MLz sub- 
15 elements may be perfomned in parallel for each said first element. 

The MLa second sub-elements of the switch anwigement may be provided as a number M 
of second elements, each of said M second elements being associated with a number U 
of second sub-elements. 

20 

Each sub-element may be capable of generating at least one said sen/ice request. 

The first sub-elements and said second sub-elements may be bi-directional and provide 
both ingress and egress from the switch fabric. The first sub-elements may comprise said 
25 second sub-elements. 

The first sub-elements and said second sub-elements may be unidirectional and then said 
first sutvelements may provide ingress and said second sub-elements may provide 
egress from the switch arrangement. 

30 

The first and second sub-elements may comprise ports in the switch anangement and 
said first elements comprise aggregations of said first sub-elements. 

The first and second sub-elements may comprise ports in the switch arrangement, and 
35 ttie first elements may comprise aggregations of said first sub-elements and said second 
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elements comprise aggregations of said second sub-elements. 

The switch arrangement may comprise an Input queued cell switch and said service 
requests comprise requests for transmitting a sendee Infonnation rate from one of said 
5 first sub-elements to at least one of said second sub-elements. 

The switch arrangement may comprise an input queued cell switch and said service 
requests comprise requests for transmitting at least one cell from one of said first sut>- 
elements to at least one of said second sub-elements. 

10 

The switch arrangement may comprise an input queued packet switch and said service 
requests comprise requests for transmitting a service information rate from one of said 
first sut>-elements to at least one of said second sub-elements. 

15 The switch arrangement may comprise an input queued packet switch and said service 
requests comprise requests for transmitting at least one packet from one of said first sub- 
elements to at least one of said second sui>-elements. 

The packets may have a fixed-length and comprise cells and said packet switch may be 
20 an input queued cell switch arranged to switch fixed-length cells, and said service 
requests may comprise requests for transmitting one or more fixed-size cells from one of 
said first sub-elements to one or more of said second sut>-elements. 

The packete may have a fixed-length and comprise cells and said packet switch may be 
25 an input queued cell switch arranged to switch fixed-length cells, and said service 
requests may comprise requests for transmitting a service information rate from one of 
said first sub-elements Li to one or more of said second sub-elements U. 

The switch arrangement may comprise a circuit based switch and said service request 
30 comprises a request for a connection in a circuit-based switch. The switch arrangement 
may comprise a circuit based switch and said service request comprises a request for a 
bandwidth in a circuit-based switch. The switch arrangement may comprise a circuit 
based switch and said service request comprises a request for a service information rate 
in a circuit-based switch. The service information rate may be a bit rate. 
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The circuit based switch arrangement may comprise at least one switch taken from the 
group consisting of: any known time-domain, frequency domain, wavelength domain or 
space domain switching technologies. The circuit-based switch arrangement may 
comprise a combination of said switches. 

5 

The switch arrangement may comprise a network, and said elements may comprise 
aggregations of network terminals or nodes and said sub-elements may comprise network 
terminals or nodes. The switch arrangement may comprise an arrangement of Inter- 
connectable sub-networks, where said elements comprise sub-networks and said sub- 
1 0 elements comprise network terminals or nodes. 

The network may be an optical network. The sub-networks may comprise optical 
networtcs. 

15 The elements may become sub-elements witti respect to elements in a higher layer of 
matching. l\/lultiple layers of matching may be performed in a hierarchy of matching levels. 

A second aspect of the invention provides a method as claimed in any previous claim, 
wherein the method of matching comprises: firstly, aggregating service requests to the 
20 highest level of the matching hierarchy, and secondly, resolving contention for said 
sendee requests at the highest level of the matching hierarchy, and thirdly, resolving 
contention in turn dovm through the matching levels to the lowest level of matching. 

A third aspect of the invention seeks to provide a matching method for a switch 
25 anangement comprising a plurality N of input elements, each Input element comprising a 
plurality (U) of Input sub-elements, and a plurality M of output elements, each output 
element comprising a plurality U of output sub-elements, the matching method comprising 
the following steps: performing a first matching across the switch fabric for each of the 
plurality of N input elements and the ML2 sub-elements by perfomiing the steps of: 
30 summing a number of requests from each of the Li sub-elements of the input element; 
generating a first N x MU request matrix; matching the first request matrix to generate a 
first matrix of accepted requests; and perfomiing a second matching across the switch 
fabric for each of the N input elements by performing the steps of: generating N 
asymmetric second Li x ML2 matrices, for each of the N Input elements; and matching 
35 each of the N asymmetric second matrices to generate N second matrices of accepted 
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requests; and generating a NLi x MU matrix of accepted requeste from the first N x MU 
matrix of accepted requests and tJie N second Li x Mi_2 accepted request matrices. 

Tlie NLi X MLz matrix of requests may be symmetric. Li may be equai to U and N may be 
5 equai to M. Tlie N second Li x MU matrices may be asymmetric or symmetric 

The sub-elements may comprise ports on a switch. The sub-elements may comprise 
nodes or temilnals In an optical nelworit The sub-elements may comprise nodes in an 
optical ring networi<. The sub-elements may comprise terminals In a passive optical 
1 0 networic (whether amplified or not). 

The switch anrangement may comprise a packet switch anangement The packet switch 
arrangement may be capable of switching fixed-length packets. The switch anangement 
may comprise a cell switching anangement The cell switching anangement may be 
1 5 capable of switching packets. 

A fourth aspect of the Invention seeks to provide a switch an-angement, the switch 
arrangement having number N of first elements, each first element arranged to at least 
provide ingress to a switch an-angement, each of the first N elements comprising a 
number Li of first sub-elements, the switch anangement having a number MU of second 
sub-elements arranged to at least provide egress from said switch anangement, and 
wherein each of the first Li sub-elements is capable of conveying a sen/ice request for at 
least one of said second sub-elements MU, wherein said sen/lce requests are conveyed 
by performing a matching method which comprises: firstly, for every one of the N first 
elements, aggregating service requests from ail U first sub-elements to each of the MU 
second sub-elements, and secondly, resolving contention for said sendee requests from 
all N first elements to one or more of said second MLa sub-elements, and thirdly, for each 
first element, resolving contention between the U sub-elements and said second MU sub- 
elements. I 

In the fourth aspect, the matching method may be according to any one of the first, 
second or third aspects. 

A fifth aspect of the invention seeks to provide a networic including a switch arrangement 
35 according to the fourth aspect 
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A sixth aspect of the Invention seeks to provide a suite of at least one computer programs 
arranged when executed to implement steps in a method according to the first, sdcond or 
third aspects. At least one program may be arranged to be implemented by software 
5 running on a suitable computational device. At least one program may be arranged to be 
Implemented by suitably configured hardware. 

A sixth aspect of the invention seeks to provide a scheduler for a switching arrangement, 
the scheduler arranged to perform a scheduling process, the scheduling process 
10 comprising: a matching method according to any one of the first, second or third aspects; 
€uid a time-slot assignment process. 

A seventh aspect of the invention seeks to provide a matching method according to any 
one of the first, second or Uiird aspects wherein the sub-elements comprise ports, and the 
1 5 matching updates tfie pointers to input ports according to tiie following mie: Pout = 1 + l(LN 

- Pm + k)inoduJ and the output ports are updated according to the following rule: pin = 1 + 
I(LN-Poui + k)„«iJ 

An eightti aspect of the invention seeks to provide a matching method according to any 
20 one of the first, second or third aspects wherein the sub-elements comprise ports, and the 
matching updates tiie pointers to input ports according to the following rule: Pom = 1 + [(LN 

- Pin + k)modusi]and ttie output ports are updated according to the following rule: pm = 1 + 
[(m-Pout + k)„odJ 

25 In any matching method aspect of the invention, the method may be arranged to enable a 
multicast matching scheme to be implemented. 

In any matching method aspect of the invention, the ML2 output sut>-elements may be 
grouped first into M groups of U sub-elements, and matching may be performed first at 
30 ttie group level between the N groups of Li input sub-elements and ttie M groups of Lg 
output sub-elements, and then, for each of the N groups of Li input sub-elements, 
between tiie Li individual input sub-elements and the M groups of L2 output sub-elements. 

A ninth aspect of the invention seeks to provide a matching metiiod for a number N of first 
35 elements, each first element arranged to at least provide ingress to a switch arrangement. 
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each of the first N elements comprising a number Li of first sub-elemente, the. switch 
arrangement having a numbo- MLz of second sub-elements an'anged to at least provide 
egress from said switch arrangement, and wherelri each of the first Li sub^lements is 
capable of conveying a sen/ice request for at least one of said second sub-elements MLa. 
5 wherein the ML2 sub-elements are grouped into M aggregations of L2 sub-elements, and 
the method comprises: firstly, for every one of the N first elements, aggregating service 
requests from all Li first sub-elements to each of the M aggregations of U second sub- 
elements, and secondly, resolving contention for said service requests from all N first 
elements to one or more of said M aggregations of U second sub-elements, and thirdly, 
10 for each first element, resolving contention between the. Li sub-elements and said M 
aggregations of U second sub-elements. 

Another aspect of the Invention seeks to provide a matching method for a multi-stage 
switch anangement having a plurality of logically associated Inputs and a plurality of 

15 outputs, wherein the matching method comprises the steps of: for each logical association 
of inputs, aggregating sendee requests from every one of the inputs which form said 
logical association; resolving contention for said aggregated sen/ice requests between all 
of the logical associations to the outputs of the switch arrangement; and repeating the 
above steps in the matching method within each logical association for a sut>set of the 

20 inputs fonming each said logical association until contention is resolved between the 
individual inputs of the switch an^ngement and the outputs of the switeh arrangement. 

Preferably, in each repetition, the number of inputs fomilng the logical association is 
reduced until each logical-association of a sub-set comprises a single input to the switch 
25 arrangement, said aggregated service requests comprise a single service request, 
whereby contention is resolved between each input of ttie switeh an^gement and each 
output of the switch arrangement 

Preferably, each step resolving contention between the outputs of the switoh arrangement 
30 comprises resolving contention between a logical association of inputs and a logical 
association of outputs having the same number of inputs . 

Preferably, said multi-stage switch an^angement comprises a pluralHy of switohing stages, 
at least one switching stage comprisirig: a plurality of switches which logically associated 
35 into different sets of switches, each set of switches being logically associated with one of 
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said logical associations of inputs of the switch an-angement, wherein each set of logically 
associated switches operate only on the inputs of the switch arrangement with which they 
are logically associated, the switch arrangement further comprising a global spatial 
switching stage arranged to receive traffic derived from any of the inputs of the switch 
5 arrangement via any logically adjacent sets of said switches. 

Preferably, said multi-stage switch arrangement comprises a plurality of switching stages, 
at least one switching stage comprising: a plurality of switches which iogiceJIy associated 
into different sets of switches, each set of switches being logically associated with one of 
10 said logical associations of outputs of the switch arrangement, wherein each set of 
logically associated switches operate only to provide output to the outputs of tiie switch 
arrangement with which they are logically associated. 

Another aspect of the Invention seeks to provide a multi-stage switch arrangement 
15 arranged to switch time-slotted traffic segmente, the switch arrangement comprising:a 
plurality of switching stages including a spatial switching stage arranged, to receive traffic 
which has been switched by at least one switching stage logically adjacent to the input of 
spatial switching stage, the spatial switching stage being further arranged to output to at 
least one switching stage logically adjacent to its output, each of said at least one 
20 switching stage logically adjacent to the input of the spatial switching stage comprises a 
plurality of input aggregation switching stages, each aggregation switching stage being 
logically associated with a subset of the inputs of the switch arrangement, each of said at 
least one switching stage logically adjacent to the output of the spatial switching stage 
comprises a plurality of output aggregation switching stages, each output aggregation 
25 switching stage being logically associated with a subset of the outputs of the switch 
arrangement, the mutii-stage switch being further arranged to implement suitable control 
means to enable the time-slotted traffic to be matched according to the matching method 
according to any method aspect of the invention. 

30 

Advantageously, the invention seeks to provide a scheduling algorithm suitable for a high- 
performance VOQ IQ switch which has a reduced level of complexity yet supports an 
35 acceptable level of throughput. The scheduling algorithm is provided with less 
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computational complexity by perfomnlng the matching over several hierarchical levels 
within and between smaller switches or sut>-networks or aggregations of input and output 
ports, by providing a matching algorithm which operates generally, but not exclusively, on 
an asymmetric request matrix. 

5 

Advantageously, the invention reduces the computing complexity and enables larger 
cell/packet switches/networte to be constructed without distributing the scheduling 
decisions too loosely between the smaller switches or sut>-networks or aggregations of 
input and output ports so that performance is degraded. 

10 

The asymmetric request matrix grants requests between inputs and outputs of differing 
levels of aggregation, e.g., switch-port, node aggregation-node, ring-node, or PON- 
terminal. In general, as more than one hierarchical level of matching is perfonmed 
between different sub-networks; inuitistage buffering and switching can be used to 

15 support this. Advantageously, however, in some embodiments of the invention, the multi- 
stage buffering/switching is implemented by means of multi-hopping. Advantageously in 
such etnbodiments, buffering remains at the switch/networic edge. This means that where 
the invention is implemented in an otherwise optical network environment, the buffering 
can be implemented electronically, avoiding the expense of optical buffering technology. 

20 . 

Advantageously, the invention can provide a global frame-based optimal scheduling 
algorithm which operates both within and between each individual sub-switch/networi<, the 
scheduling algorithm comprising a matching algorithm stage and a channel assignment 
(time-slot assignment) stage. The global frame-based multi-level matching scheme uses 

25 multiple aggregation levels. Channel assignment can be provided by any suitable 
mechanism, for example, one example of a time-slot assignment process is described by 
the inventors in their United Kingdom Patent Application No. GB-A-0322763.4, the 
contents of which are hereby incorporated into the description by reference. In a 
preferred embodiment of the invention the channel assignment stage comprises a method 

30 of buffering the timeslot interchanging stages by multi-hopping (3 hops) between sub-sets 
of the networi< nodes (tenninals) so that buffering can be located at the edge nodes only 
as described by GB-A-0322763.4. 

In contrast to the prior art, the invention may use asymmetric traffic request matrices, 
35 applied to different parts of the overall networic with different levels of aggregation, in order 
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to reduce the matching complexity. For example, the asymmetric request matrix can be 
between input switch-output port or upstream ring-downstream node or upstream PON- 
downstream temninal. This allows sufficient information about individual port, node or 
tennlnal identities to be retained to prevent receiver contentions and source blocking, 
5 while reducing the overall matching complexity. 

The preferred features of the invention (or dependent accompanying claims) can be 
combined in with any of the aspects of the invention (or independent accompanying 
claims) in any appropriate manner apparent to those skilled in the art. The Invention will 
10 now be described with reference to the folloviring drawings which are by way of example 
only and in which: 

Rgure 1 is a sketch of a N ^ N input queued packet switch; 

15 Rgure 2 is a sketch of a 4 x 4 input queued packet switch showing virtual output queues 
VCX3s and its corresponding request matrix; 

Rgure 3a is a sketch showing the input and output elemente and sub-elements of a switch 
arrangement according to an embodiment of the invention; 

20 

Rgures 3b and 3c show a simplified view of the switch in Rgure 3a and an unpopulated 
symmetric LN x LN matrix for the switch shown in Rgure 3a respectively; 

Rgure 4 shows steps in a method according to an embodiment of the invention; 

25 

Rgure 5 is a sketch showing schematically the aggregation of requests for the switch 
shown in Rgure 3a in the first level matching method according to an embodiment of the 
invention; 

30 Figure 6 shows schematically the steps of aggregating requests and the 1^ level of 
matching in multi*level matching scheme according to an embodiment of the invention; 

Rgure 7 shows schematically tfie steps of perfonning multiple, parallel matchings of N 
elements, each having L input ports, including de-aggregation, in the 2^ level of a multi- 
35 level matching scheme according to the invention; and 
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Rgure 8 shows the pointer positions for the 2 x 4 asymmetric request matrix [ri(n,j)] In the 
first cycle or frame (k=0) for a multi-level r^iatchlng scheme according to an embodiment of 
the Invention. 

5 

The best mode of the invention as cunently contemplatecl by the inventors wili now be 
described. This Invention relates to the matching part of a frame-based scheduling 
algorithm. The matching algorithm is able to use multiple levels of aggregation for padcet 
requests. The term pad(et is used here to refer to multi-cast and unicast paci<ets of fixed 
10 length (i.e., fixed as in a cell has a fixed length) or variable length as is apparent to those 
sidlled in the art The switeh arrangements described relate to a number of possible 
embodimente, including packet, ceil, and circuit switching anangements. A cell switoh 
can additionally include means to switoh paclceto of fixed and/or variable length in some 
embodiments of the invention. 

15 

The invention can be used to matoh service requests in any switeh anangement provided 
over a networic. For example a matohing for service rate requests between ports on any 
switch can be provided by the invention, as well as a matching on a larger scale between 
sub-networte within a communications networic For example, the matching process csm 
20 be used wrtien traffic needs to travel between interconnecting optical networi<s and rings. 
As has been mentioned above, the invention can also, in some embodiments, be used to 
match service requests in a circuit switeh environment. 

A specific embodiment of the invention will now be described with reference to Rgures 
25 3a,3b, and 3c of the accompanying drawings. Figure 3a shows schematically a switch 
comprising a number of elements #a1,...,#aN and #b1....#bM between which traffic can be 
switched over switch fabric 31. Each of the elements #a1..#aN has a number of sub- 
elements, and each of the elements #b1..#bM has a number of sub-elements. The 
number of sub-elements Li may not be equal to the number of sub-elements La, and the 
30 number of elements N may not equal the number of elements M in some embodiments of 
the invention. In a preferred embodiment of the invention the product LiN is equal to the 
product L2M, and in the best mode of the invention N = M and Li= Lg. The sub-elements 
may comprise unidirectional ingress or egress to the switch fabric, or they each may 
comprise bi-directional ingress and egress facilities to and from the switch fabric. 

35 
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In one embodiment of the invention, the elements N, M comprise sub-networlcs in a 
network connected by a hub switch fabric, and each sub-element, comprises a node or 
terminal on each sut>-network'^ #a1...#aN or sub-network #b1...#bM. For e^mple, 
consider an embodiment where a switch is arranged to switch traffic moving between 
5 different rings and/or networks and needs to be capable of switching traffic at different 
levels of aggregation, for example between optical networks (particularly passive optical 
networks PONs). Such a switch needs to have high perfonnance and support fast 
sv\ntching speeds in a reliable and fair manner, as discussed by Bianco et ai in their paper 
on Access Control Protocols for Interconnected WDM Rings in the DAVID Metro Network, 
10 IWDC 2001 (International Workshop on Digital Communications), Taormina, Italy, 
September 17-20, 2001 the contents of which are hereby incorporated by reference. 

It will be appreciated by those skilled in the art that whilst Figure 3 is described with 
reference to elements and sut>-elements, there is a clear analogy to embodiments in 
15 which the elements comprise, for example, different terminals or ports of a switch, or 
different temiinals of a network, or different tenminals of a sub-network, or a sub-network 
having a number of different terminals, or a switch having a number of temiinals or ports. 

i 

In the prior art, it is known to perform the matching between the input and output ports or 
20 tenminals of a switch or network conventionally in one operation or hierarchical level. The 
preferred embodiment of the invention proposes a matching scheme for a switching 
arrangement comprising a number of input sub-elements (for example ports or terminals) 
which are grouped into elements and the matching is performed in more than one 
hierarchical level in a global, end-to-end manner. 

25 

The invention seeks to Increase the amount of parallel processing that can be performed 
in the matching and reduce the computing steps required for the matching. The elements 
can, in some embodiments of the invention, be arbitrary sub-sets of the sub-elements, for 
example, the sub-elements could comprise ports (terminals) without any particular 

30 physical significance or in alternative embodiments comprise ports on real sub-networks. 
For example, if the switch input and output elements comprise rings in an interconnecting 
switching arrangement of rings, then the input and output sub-elements could comprise 
the individual nodes or terminals. AKematively, if the switching arrangement is a large 
switch comprising a plurality of interconnected smaller switches, then the sub-elements 

35 could comprise the ports on smaller switches. 
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This Invention therefore provides a global matching algorithm for use In either single-stage 
or multi-stage switching and buffering networks, without resorting to complete autonomy 
of the smaller elements (I.e. sub-sets of ports or terminals, switches or sub-networks), nor 
5 aggregating requests at too high a level (e.g. ring-ring or PON-PON). The inventfon 
provides a matching method In which the sub-elements (e.g., the porte or terminals) are 
grouped into elements and the matching Is perfonned In more than one hierarchical level, 
and in a global, end-to-end manner. This has the benefit of increasing the amount of 
parallel processing that can be perfonned in the matching and reducing the computing 
1 0 steps required for the rifiatching. 

As described with reference to the prior art, matohing algorithms conventionally make use 
of symmetric traffic request matrices between ceil- or packet-switch ports, or between 
rings or PONs In packet networt(s, or between nodes or tenminals in packet networits. But 

15 this invention employs traffic request matrices which are in general {tnA not exclusively) 
asymmetric, and which are applied to different parts of the overall networic with different 
levels of aggregation, in order to reduce the matching complexity. For example, at a firat 
level of aggregation the matching may be between input elements and output sub- 
elements,.and a second level of matching may be between input sub-elements and output 

20 sub-elemente. In this way, an asymmetric request matrix can be generated for service 
requests between an Input switch element and an output port of an output switch element. 
Altematively, an a^metric matrix could be generated between an upstream ring 
element and a downstream node sub-element or alternatively, an upstream PON element 
and a downstream terminal sub-element 

25 

By providing a two-level, global (i.e., end to end) matching process between the elements 
and sub-elements and sub-elements to sub-elements, sufftelent information about 
individual port, node or temriinal identities can be retained to prevent receiver contentions 
and source blocking, while reducing the overall matching complexity. 

30 

In some embodiments of the invention, more than one level of aggregation can be 
implemented In the matching method, and as such elements become suti-eiements with 
respect to elements In a higher layer of matching. For example. It Is possible for multiple 
layers of matching to be perfonned In a hierarchy of matching levels in some 
35 embodiments of the invention. As an example, one method of matching accoiding to an 
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embodiment of the invention comprises the following steps: 

firstly, aggregating service requests to the highest level of the matching 
hierarchy! and 

secondly, resolving contention for said service requests at the highest level of 
5 tiie matching hierarchy, and 

thirdly, resolving contention in turn down through the matching levels to the 
lowest level of matching. 

In Rgure 3a, a switch 31 is shown schematically to be arranged to switch traffic between a 
10 number of elements N (denoted #a1...#aN), each having Li sub-elements across a 
suitable switch fabric 31 (for example a hub) to a number of output sub-elements, here 
MLa in number. Each element a#1..a#N comprises a number of different sub-elemente 
Li, and in Rgure 3a, an embodiment of the invention is shown where the l\4L2 sub- 
elements are shown aggregated into M groups of U sub-elements. The grouping (or 
15 aggregation) of the output sut>-eiements into M elemente, does not ocqur in other 
embodiments of the invention. Switch 31 therefore comprises NLi inputs and MU 
outputs, i.e., switch 31 effectively comprises an NLi x ML^ switch. As an example, 
consider an optical embodiment of the invention where aggregation of both inputs and 
outputs may be present if the N, M elements are PONs or optical ring networks as then 
20 both the Li and L2 sub-elements may comprise user terminals or nodes. 

Figure 3b shows a simplified representation of the switching arrangement showed in 
. Rgure 3a, which illustrates more clearly the sub-elements fonming the inputs and outputs 
of the switch 31. In Rgure 3b, switching arrangement 40 comprises LiN input sui> 

25 elements i (for example ports) and L2M output sub-elements j(for example ports) J. A 
conventional matching algorithm for frame-based scheduling such as that which Bianco et 
al describe employs multiple phases for matching, such as was described referring to the 
prior art. Such a conventional technique produces a match for a symmetric request 
matrix, LiN x L2M (or if N = M and Li - La^L, LNxLN) in size such as is shown in Rgure 

30 3c (where a grid is shown as the matrix as yet unpopulated). 

Asymmetric. Multl-Staae Matchino Usino Multiole Levels of AQareoation 

35 Referring now to Rgure 4 of the accompanying drawings, an overview of the steps in a ^ 
matching method according to the invention is shown suitable for the switch environment 
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shown in Rgures 3a,3b, and 3c of the accompanying drawings. 

Where a conventional input queued switch arrangement is being considered, the term 
sub-element is used to refer to any ports and the temn element then refers to an 
5 aggregation of such ports. Where the switch arrangement is provided by a network 
element interconnecting a number of optical networks (for example, an optical ring 
network, or passive optical networks (PONs), the term sub-element is used to refer to any 
nodes on the rings or terminals on the PONs and the term element refers to an 
aggregation of such nodes or terminals, for example, the term element could refer as such 
10 to a ring network or a PON. 

As has been mentioned, in some embodiments of the Invention, the switch arrangement 
comprises part of a network, and the network comprises Interconnected sub-networks. In 
such embodiments, at one hierarchical level the sut>-networics are the elements, and the 

15 nodes or terrninals in each sub-network comprise the sub-elements, sub-element. For 
example, where the switch arrangement is provided by a network element interconnecting 
a number of optical networks (for example, passive optical networi<s (PONs)), the temn 
sub-element is used to refer to any nodes or terminals on the PONs and the term element 
can refer to the PON. As will be appreciated by those skilled In the art, the hierarchical 

20 matching process according to the invention is not limited to such embodiments, but may 
be implemented in any switching environment where differing levels of aggregation can be 
effected at least for the inputs to the switch arrangement. 

In Rgure 4, an algorithm according to one embodiment of the invention is shown in which 
25 NLi Input sub-elements are capable of generating service requests for ML2 output sub- 
elements over a switch fabric. The NLi input sub-elements are aggregated as N elements 
#a1...#aN, each element comprising Li sub-elements. The ML2 output sub-elements may 
be aggregated into M elements, each element comprising L2 sub-elements in some 
embodiments of the invention, but need not be so aggregated in other embodiments of the 
30 invention. In Figure 4, the U sub-elements are aggregated into M=N elements, 
#b1..#bM. Aggregation for each of the N elements #a1...#aN of the switch arrangement 
30 is initially performed in step 41 by summing the total number of requests for each of the 
Li sub-elements of each of the N elements, i.e., for each element the total number of 
requests destined for each of the ML2 sub-elements is summed over its Li input ports in 
35 step 41 . Each element N can be considered alternatively as a group of sub-elements. 
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A first matching is tlien performed in which the service requests are matched at a first 
aggregation level by generating an asymmetijc N x MU request matrix for each of the N 
Input elements In step 42. The notation used here means that the matrix has N rows and 
5 MU columns, where N Is an integer and MU is an integer. 

A second matching process is then performed in step 43 in wrtiich N separate matchings 
are performed, one for each of the N elements comprising Li sub-elements (input ports). 
This involves N separate Li x UM asymmetric request matrices. De-aggregation is 

10 thereby perfomned back from the aggregate level of the N elements to the MU output sub- 
elements (i.e., output ports) to the aggregate level of Li Input ports to MLg output ports, in 
step 44. It will be appreciated by those skilled in the art that the N matchings of step 43 
could of course be performed sequentially, but it Is advantegeous in temns of the totel time 
taken to mn the algorithm if the number of computing steps (times) can be reduced by 

15 perfonning them simultaneously, in parallel, using multiple matching "processoraT. The 
latter is the preferred approach and is adopted in the t>est mode of the invention currently 
contemplated by ttie inventors. It is also possible for the numt)er of eiemente and sub- 
elemente to differ on each side of the switch as has been mentioned before. 
Anoreaation of Reaueste 

20 

Figure 5 shows schematically how each of the Li sub-elemente of input element #a1 in 
Rgure 3a is initially aggregated into a group of sub-elements. StricAly speaking, because 
the particular matching algorithm being used as an example takes the queue lengths as 
inpute for Ite first normalisation stege, the requests in this step are simply the queue 
25 lengths. (However, the number of requests could be any alternative number of requeste 
for cells/packete to be switched, using other criteria for calculating that number. For 
example, each VOQ request used could be calculated as the queue length limited to a 
ma}dmum of F requeste for the next frame). 

30 Figure 6 shows the aggregation of request step 41 and the first level of matching step 42 
of Rgure 4 in more detail, in Rgure 6, N = ly/l, and Li = U = L for simplicity. 

In Rgure 6, there are N groups of L sub-elements (here a sub-element comprises an 
input port). Each element employs an L x LN asymmetric queue matrix [Q(i,J)]individuai to 
35 represent the numbers of backiogged ceils/packets in each input port of the element 
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destined for each output port of the switch arrangement (I.e., of the switch or switching 
networl( as appropriate). 

Each L X LN matrix [Q(i.j)lindi»i*iai is simply that portion of the gloljal LN x LN [Q(l j)] matrix 
5 for all VOQs of the entire switch or switching network relating to that particular element 
comprising a group of L sub-elements. Thus each element (altematively each group of 
input ports) has all its input port-output port requests recorded. The queue lengths of all 
Input ports In each lQ(i.i)]i«iividuai matrix are then aggregated (summed) into N aggregated 
queue matrices, each aggregated queue matrix having the fomi of a IxLN matrix. The N 
10 aggregated queue matrices are equivalent to a single N x LN aggregated queue matrix 
[Q(n,j)lagg representing the traffic queued In the N groups of Input ports for the output ports 
of the switch anangement. 

1^ Level of Matchino 

15 

The first level of matching Is just one matehing covering the entire switeh or network. It 
takes the N X LN asymmetric, aggregated queue matrix lQ(n j)]aoB as its Input, as shown in 
Rgure 1 . 

20 Output and Input t)ooking using the example matching algorithm are summarised as 
follows. Outputs for the matching still represent the overall output ports of the switoh. but 
inputs represent here the N elements, each element comprising a group of sub-elements 
(i.e. a group of input ports). The matrices now possess an Index representing the 1* or 2™* 
level of matching. 

25 

Rrstly, the aggregated queue matrix undergoes a normalisation stage: 

[Q(n J)]agg => [Qnom.(n.j)]asg> [ri(nj)] 

Then the output booking and input booking phases described herein above are perfomied 
30 in a similar manner: 

Output Booking phase: [ri(nj)] => [gi(n,j)]: ^ gi(n,j) ^ F - [J^ Qnonn(n,J)lag9 

n a 

Input Booking phase: [gi(nj)l => [ai(n,j)]; J] ai(n,j) ^ F 
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Here the gi(i,j) are elements in the first matrix of granted requests and the ai(i,j) are 
elements in the first matrix of accepted grants. 

a"'^ Level of l\/latching 

5 

Figure 7 shows in more detail steps 43 and 44 in which multiple, parallel matchings of N 
elements of input ports , including de-aggregation, are perfonned in the second level of 
matching. In the 2"^ level of multi-level matching the aggregated acceptances in the 
acceptance matrix [ai(n,j)] from the 1^* stage of matching provide the limits for matching 

10 between the overall input and output ports within each of the N groups of input ports, I.e., 
within each of the N elements. The input for the matching within each element or group of 
sub-elements is taken as the original L x LN asymmetric queue matrix [Q(i,j)]individuai- 
Performing the matchings automatically provides de-aggregation back from the level of 
the grouped input ports-output ports to individual input ports-output ports. Normalisation 

15 and output and input booking are then performed in the manner described below. In this 
2"^ level, outputs and inputs of the matching once again represent the overall output and 
input ports of the switch/network. 

Normalisation stage: [Q(i,j)]inciividua! => [Qnomi(iJ)]individuai, [raCij)] 
20 Output Booking phase: [rgCiJ)] => [gaOj)]; X ^^(id) ^ ai(n,j) - [J^ Q norm (i i j)]indMdual 

Here the g2(IJ) are elements In the second matrix of granted requests and the a2(IJ) are 
elements in the second matrix of accepted grants. 

The summation Is taken over only the Input ports within an element, and the value of n in 
25 the summation is obviously the Identity of that element. 

Input Booking phase: [Qzill)] => laaOj)]; 2 ^2(i,j) ^ F 

J 

An example demonstrating a specific embodiment of the Invention - Multi-Level Matching 
30 -Aggregation 

As a simple example of the hierarchical, multi-level matching principle. It is possible to 
partition the overall queue matrix [Q(ij)] for the switch in Eqn.1 Into N=:2 elemente each 
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comprising L=2 input ports (i.e., N= 2 groups of b=2 sub-elements), and to aggregate the 
requests (queue lengths) destined from the same group of input ports to each output port 
into a single 2x4 matrix, i.e. 

Eqn.23 [2 4 8 ij 

[3 6 12 9~1 
12 9 3 ej 

r4 8 1 2l 
[s 1 2 4J 

Multi-Level Matching - the First -Level Matchino Normalisation Stage 



10 



Because the input "ports" for this matrix are the elements (i.e., the groups of input ports), 
the row-sums over the elements are obviously larger, typically, than the column sums over 
output ports (which have not been aggregated). We will define maxval as the maximum 
column-sum or (row-sum/L), the latter being (row-sum/2) in this example, because each 
15 element or group of sub-elements contains L=2 input ports. With this definition, maxval is 
again 15, and every queue length is multiplied by the ratio F/15=4/15 and the integer part 
of the resulting number is taken. Hence the normalised queue matrix becomes 

Eqn. 24 [Qnorm(n,j)]agg =[3201] 

All of these cells or packets are assumed already to be granted by the output ports and 
20 accepted by the elements at the group level for the sub-elements of that element. The 
request matrix presented to the next "no overbooking" stage is the difference between the 
original queue matrix and the normalised queue matrix, i.e. the remaining requests 

Eqn. 25 [ri(n,j)] = [Q(n j)lagg - [Qnorm(nd)]agg 

_ r3 6 12 9] [0 1 3 21 
" [12 9 3 6] ' [3 2 0 ij 

^[9735] 

Multi-Level Matching - the First-Level Matchino "No Overbooking" S taoe - Output Booking 
Phase 



30 The number of requests in effect already granted by the output ports in the normalisation 
stage is 
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Eqn.26 Qnonn(n j)]agg = [3 3 3 3] 

n 

These are precisely the same numbers as were granted by the normalisation stage in 
conventional single-level matching (Eqn.4). 

5 Once again, the remaining number of grants available in each output port is therefore 
Eqn.27 [f f f f]-[3 3 3 3l=[i i i i] 

Step 3 of the "no overbooking" algorithm described by Bianco et al applies again. But the 
NOB25 pointer up-date rule needs to be modified for the asymmetric request matrix 
10 [ri(n,j)], to ensure that input ports and output ports point to each other, even though there 
are different numbers of each. (NB the N input ports are synonymous at this 1^ matching 
level with the N groups of overall input ports). Input and output ports for this 1^-level 
matching now require slightly different relationships between ports, i.e. 

15 Multi-Level Matching - the First-Level Matchino Pointer Uo-Date Rule for Asvmmetric 
Request Matrix: 

Eqn. 28 for input ports: Pout = 1 + [(LN - Pm + k)mpd ml 

for output ports: p^, = 1 + [(LN - Pout + k)mod J 

20 

In our example witti L=2 and N=2, this becomes 
Eqn. 29 Pom = 1 + [(4 - Pm + k)mod4l 

Pin =1 +[(4-Pout + k)mod2l 

25 

Figure 8 summarises the pointer positions for the 2x4 asymmetric request matrix [ri(nj)] in 
the first cycle or frame (k=0). In any cycle (frame) k, two input ports point to two output 
ports, each of which output ports points back to the same input port that points to it. The 
remaining two output ports alsd point to the two input ports. Hence each input port is 
30 pointed to by two output ports, but only two of the four output ports are pointed to by input 
ports. After 4 cycles or frames, each input port has pointed to each output port once in 
turn, and each output port has pointed to each input port twice. 

Hence In the first frame k=0, with b=2 and N=2 in this example, output port 1 points to 
35 input port 2, 2 points to 1 , 3 points to 2 and 4 points to 1 , i.e. the pointers point to requests 
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r{2,1), r(1,2), r(2,3) and r(1,4) in Eqn. 13. All of these matrix elements have more than one 
request, and because Eqn.27 allows only 1 more available grant for each output port, 
each of these four matrix elements will be granted one more request, i.e. 

Eqn. 30 additional output booking grants, [gi(n,j)] = [^J J ^ J j 

5 

Multi-Level Matching ~ the Rrst-Level Matching "No Overbooking" Stage - Input Booking 
Phase 

From Eqn.24, the number of requests in effect already accepted by the input ports (really 
10 the 2 groups of input ports) in the nonmalisation phase is 



Eqn. 31 QnoUnjWagg = 

Because each sub-element group contains 2 overall input ports, the remaining number of 
acceptances available in each sub-element group is therefore 

Eqn. 32 [^^] - [£ Qr^mU, = [«] - [^] = 

15 The matrix to be used in this input booking phase is the additional output booking grants 
matrix [gi(n,j)] (Eqn.30). Step 2 of the "no overbooking" algorithm applies- All of the 
additional grants are therefore accepted, so the additional acceptance matrix is 

Eqn. 33 additional input booking acceptances, [aiadditionai(n,j)] = [i q i o] 
The final 1® -level acceptance matrix becomes 

20 Eqn. 34 [ai(n,j)] = [Qnomi(n,j)]agg + [aiacldltlor«^^^ 2 0 l] [l 0 1 o] " [4 2 1 l] 

Note that all output ports fill all F»4 time slots and all input ports (sub-element groups) fill 
all 2F=8 time slots, in this first frame. A full set of 16 cells or packets are accepted. 

25 Multi-Level Matching - the Second Level Matching for Sub-element Group 1 

From Eqn.23 the queue matrix for the first sub-element group (i.e. the aggregation of 
input ports comprising the first element) is 

Eqn. 35 [Q(i j)lindividuai =[2431] 

30 and from Eqn34 the number of acceptances to each output port from the 1^ level of 
matching for this aggregation are 
Eqn. 36 [ai(1j)l= [0233] 
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These acceptances represent the maximum number of grants that will be allowed by each 
output port to all input ports in this 2""* level of matching. 

Multi-Level Matching - the Second Level Matching for Sub-element Grouo 1 
5 Normalisation Stage 

The maximum row-sum or column-sum, maxval, in Eqn.35 is 15. The normalised queue 
matrix [Qnorm(i j)]individuai is obtained from Eqn.35 by multiplying each element by the factor 
F/maxval = 4/15 and taking the integer part of the resulting number, i.e. 

10 Eqn. 37 [Qnonn0.j)lincfivWual = j^^ 1 2 o] 

All of these cells or packets are assumed already to be granted by the output ports and 
accepted by the Input ports. The request matrix presented to the next "no overbooking" 
stage is the difference between the original queue matrix and the nomialised queue 
matrix, i.e. the remaining requests 

15 

Eqn. 38 [raOJ)] = [Q(i,j)]indlvidual - [Qnonn(iJ)lincfivldual= [2 4 g l] " [o 1 2 o] " [2 3 6 1^ 

Multi-Level Matching - the Second Level Matching for Sub-eleme nt Group 1 "No 
Overijooking" Stage - Output Booking Phase 

20 The number of requests in effect already granted by the output ports in the nomialisation 
stage is 

Eqn. 39 QnoimOJWindlvldual = [o 1 3 2] 

i 

The number of additional grants allowed is 
Eqn. 40 

25 [ai(1,i)]-[2; Qnonn(ij)lindivIdual= [0 2 3 3] - [O 1 3 2] = [O 1 0 l] 
t 

Step 3 of the "no overtDOoking" algorithm applies. The pointer positions in the first cycle or 
frame (k=0) are as in Figure 8. Output port 2 points to input port 1 and output port 4 also 
points to input port 1. Both additional grants are made, i.e. 

30 Eqn. 41 additional output booking grants, [g2(i j)] = [J ^ o o] 
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Multi-Level Matching - the Second Level Matching for Sub-element Group 1 "No 
Overbooking" Stage - Input Bool^ina Phase 

The number of requests in effect already accepted by the input ports in the normalisation 
5 stage is 

Eqn. 42 [J] Qnorm(ifj)]incfividuat = 

J 

The number of additional acceptances allowed is 

Eqn. 43 [^] ' ^2 Qnorm(i j)]indlvidual " 4 " [3] [l] 

The request matrix for this input booking phase is the additional output booking grants 
10 matrix [g^iUl)] (Eqn.41). Step 3 of the "no overbooking" algorithm applies. In the first cycle 
or frame input port 1 points to output port 4 (Rgure 8), so the additional input booking 
acceptance matrix becomes 

Eqn. 44 [azacwifonaiO j)l - [q o o o] 

The final acceptance matrix is the sum of the acceptances from the initial normalisation 
15 (Eqn.37) plus these additional acceptances from the "no overbooking" algorithm (Eqn.44), 
i.e. 

Eqn. 45 [a2(iJ)]M^ = [Qnonm(i »j)]lndividuaJ+[a2addmonal(i i j)l 

_ ["0 0 1 2] To 0 0 ll 
" [o 1 2 oj [o 0 0 oj 
_ ro 0 1 3] 
1 2 oj 

20 

Multi-Level Matching - the Second Level Matching for Sub-element Group 2 

From Eqn.23 the queue matrix for the second sub-element group (i.e. the second group of 
25 sub-elements) is 

Eqn. 46 [Q(ij)]individuai =[3124] 

and from Eqn.34 the number of acceptances to each output port from the 1^ level of 
matching are 

Eqn. 47 [ai(2,j)] =[4211] 

30 

These acceptances represent the maximum number of grants that will be allowed by each 
output port to all input ports in this 2"** level of matching. 
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Multi-Level Matchinn - the Second Leve l Matching for Sub-element Group 2 
IMormaiisation Stage 

The maximum row-sum or column-sum, maxval, in Eqn.46 Is 15. The nonnalised queue 
5 matrix [Qnorm{ij)lndividuai is obtained from Eqn.46 by multiplying each element by the factor 
F/maxval = 4/15 and taking the integer part of the resulting number, i.e. 

Eqn. 48 [QnomiO J)]individiial = j^^ q q jj 

Ail of these cells or packets are assumed already to be granted by the output ports and 
accepted by the Input ports. The request matrix presented to the next "no overbooking" 
10 stage is the difference between the original queue matrix and ttie nomalised queue 
matrix, i.e. the remaining requests 

Eqn. 49 [rzOJ)] = [Q(i>])]iR(IMdual - [Qnonii(i>j)]indivkliial 

["4 8 1 2"] fl 2 0 O"! 
~ [s 1 2 4J ' [2 0 0 ij 

Multi-Level Matching - the Second Level Matching fo r Sub-element Group 2 "No 
Overbooking" Stage - Output Bookin g Phase 

The number of requests in effect already granted by the output ports in the nonmalisation 
20 stage is 

Eqn. 50 Qnom>(i.j)llndl*lcJual =[3 2 0 1] 

i 

The number of additional grants allowed is 

Eqn. 51 [ai(2,j)] - [J^ Qnom,(i.j)]lndividL.al = [4 2 i l] - [3 2 0 l] = M 1 ol 

25 Step 3 of the "no overiDooking" algorithm applies. The pointer positions in the first cycle or 
frame are as in Figure 8. Output port 1 points to input port 2 and output port 3 also points 
to input port 2. Both additional grants are made, i.e. 

Eqn. 52 additional output booking grants, feaCiJ)] = [j © i o] 



30 Multi-Level Matching - the Second Level Ma tching for Sub-element Group 2 "No 
Overbooking" Stage - input B ooking Phase 
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The number of requests in effect already accepted by the input ports in the nonnalisatlon 
stage is 

Eqn. 53 QnomiOij)lindividual = L 

The number of additional acceptances allowed is 

5 Eqn. 54' - Qm)m,(i,j)]individual = [4] - [3] = [}] 

The request matrix for this input booking phase is the additional output booking grants 
matrix [gziUl)] (Eqn.52). Step 3 of the "no overbooking" algorithm applies. In the first cycle 
or frame input port 2 points to output port 3 (Figure 8), so the additional Input booking 
1 0 acceptance matrix becomes 

Eqn. 55 [a2addiiionai(i j)] =[0010] 

The final acceptance matrix is the sum of the acceptances from the initial normalisation 
(Eqn.48) plus these additional acceptances from the "no overbooking" algorithm (Eqn.55), 
15 I.e. 

Eqn. 56 [aaO j)W = [Q norm(ij)]indivlcIual+[32additional(i»j)l 

_ ri 2 0 ol To 0 0 Ol 

. " [2 0 0 . ij [0 0 1 oj 

_ ri 2 0 o"! 

[2 0 1 ij 



20 

Multi-Level Matching - Overall Acceptance Matrix 

The overall matrix of accepted requests is the concatenation of Eqn.45 and Eqn.56 for the 
two sub-element groups, i.e. 

["0 0 13" 
0 12 0 

25 Eqn. 57 [a2(ij)l= 1200 

2 0 1 i_ 

Note that the 2™* level of matching has been unable to maintain the full set of 16 
acceptances achieved by the 1^ level of matching of tiie sub-element groups. Only 14 cell 
or packet requests are finally accepted. This contrasts with the acceptance matrix 
resulting from conventional, single-level matching in Eqn.22, which achieves a full set of 
30 16 acceptances. The reason for the reduction in accepted requests may be due to tiie 
nature of the modified NOB25 pointer up-date rule suggested for use with asymmetric 
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request matrices in Eqn.28. This is discussed below, and a further modification is 
proposed. 

Multi-Level Matching - Pointer Uo-Date Rules for Asymmetric Reouest l\/latrtces 

* 

5 

For the particular asymmetric request matrices used in the working example, there 
appears to be a problem with the modified NOB25-like pointer up-date rule (Eqn.28). It 
can be shown that there are pairs of output ports that always point to just one, common, 
input port in every frame. (The particular input port for a given pair of output ports changes 

10 cyclically from frame to frame). This can result in requests being granted by two output 
ports to the same input port during the output booking phase, causing overbooking, which 
causes one of these grants to be dropped during the following input booking phase. This 
never happens for symmetric request matrices, because output port pointers never point 
to the same input port; there are enough input ports for all the output port pointers to point 

15 to different input ports. The problem has been found to happen when only two of the 
output ports are able to grant additional requests, and the two ports happen to be a pair 
that point to the same input port. 

One embodiment of the invention provides a possible solution to this problem by deciding 
20 the pointer positions after the number m of output ports that are .allowed to make 

additional grants is known. This would no longer be a detenministic pointer up-date rule. 

However, the nature of the no overbooking algorithm described above ("NOB25") could be 

preserved to some extent by adapting the rule to the variable number of output ports m. 

Once this number m Is known for a matching in a frame, these m ports would be ranked in 
25 order and the output port pointer positions would be calculated as follows 

Eqn. 58 for output ports: Pm = 1 + [(m - Pout + k)mod J 

where Pout is now the rank of the output port allowed to make additional grants, not the 
30 output port's identity. If the same sub-set of output ports are allowed to make additional 
grants recurrently in each frame, then the overall effect may be like NOB25, in that the 
pointers would step around these ports by one port in each frame. In the worked example, 
if the same two output ports were to recur in every frame, they would always point to 
different input ports, which should remove the output overtDOoklng. 

35 
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Because there are always more output ports than input ports in the asymmetric request 
matrices, so that no two input ports ever need to point to the same output port, there is no 
need to adapt the NOB25 rule for the input portsf For these ports the pointer up-date rule 
can remain deterministic. The input port pointer positions would be calculated as in 
5 Eqn.28, i.e. 



Eqn. 59 



for input ports: Pout = 1 + [(LN - Pin + k)mod um] 



10 



where Pin Is the input porf s identity. 



Using this rule, it can be shown that the overall acceptance matrix for the worked example 
using multi-level matching becomes 

0 0 13' 

Eqn. 60 Ia20j)l= J 2 ? o 

3 0 0 1 

The matching now accepts a full set of 16 requests In the first frame. They are not all 
1 5 taken from the longest VOQs. For example, a2(3,3) is one of the shortest queues with only 
one request. 



Those skilled in the art will appreciate that the number of hierarchical matching levels 
depends on the number of levels of aggregation used: In the examples discussed with 
20 reference to Figures 3 to 8 of the accompanying drawings, only two levels of aggregation 
were used. For larger switch arrangements it is possible to have more than two levels of 
aggregation and the number of matching levels increases accordingly. 

The invention can be applied to switching arrangements having bi-directional 
25 elements/sub-elements as is apparent to those skilled in the art. The invention can be 
implemented in any suitable form, including as a suite of one or more computer programs 
which may be implemented using software and/or hardware and the matching algorithm 
may be provided in a form which is distributed amongst several components. 



30 The matching process can thus be implemented by one or more hardware and/or software 
components arranged to provide suitable means. For example, to implement the 
matching process on requests for sen/ice which are queued at the input of the input 
queued switch, the hardware and/or software component implementing the invention may 
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include arbiters of parallel or serial operation. 

Those skilled in the art will also realise (hat where reference has been made to the switch 
arrangement having NL Inputs and NL outputs, this can be generalised to the case where 
5 a switch arrangement has NLiinputs and ML2 outputs, and that specific features of such 
embodiments are not limited to the specified number of inputs and outputs which have 
been described here for simplicity. 

In the embodiment of the invention described above, the multi-level matching technique 
10 first matches N input elements to ML2 output sub-elements, at the highest level of the 
matching hierarchy, then matches the Li input sub-elements within each Input element to 
the ML2 output sub-elements. As will be apparent to those skilled in the art, more than two 
hierarchical levels can be implemented by this invention, but the embodiment described 
above employs a two level hierarchy for simplicity. This can provide a better matching for 
15 request matrices than is possible using other scheduling algorithms that perform the 
matching with a greater degree of aggregation of output ports, nodes or terminals, such as 
ring-to-ring or PON-to-PON. 

Those skilled in the art will recognise that in the context of the invention, the term element 
20 refers collectively to a group of sub-elements. The above embodiments of the invention 
have described various examples where firstly all the input elements of the switch 
arrangement are matched to the output sub-elements of the switch arrangement (i.e., 
groups of sub-elements are matched to output sub-elements of the switch arrangement) 
and then the individual sub-elements in each group are matched to the output sub- 
25 elements of the switch arrangement However, it is possible to group the sub-elements of 
the outputs, and outlet grouping can be combined with input grouping. 

OUTLET GROUPING 

30 It is known to have switch (and network) anrangements where groups of output ports all 
have the same destination, e.g. where they all transmit on the same link to the next 
switch. It is known from ATM switching that under these circumstances it does not matter 
to which particular output port of such a group of output ports the individual cells are sent 
by the ATM switch fabric. This Is known as outlet grouping, and it reduces the blocking 

35 probability of output contention, by sharing the group of output ports between all cells 
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destined for the same outgoing link. This approach is also known to be attractive In optical 
packet networks, where a number of wavelength channels within the same outgoing fibre 
link are shared between the cells or packets within the fibre. It does not matter which 
wavelength channel is used by each individual cell or packet. Not only is blocking resulting 
5 from output contention reduced by outlet grouping, but the computing complexity of 
matching and channel assignment (both time-slot and wavelength) may also be reduced, 
due to the aggregation of output ports, (output sub-elements) into groups of output ports 
(output elements). 

10 Although there are potential problems of cell or packet mIs-sequencIng at subsequent 
switches, which may need to be addressed, such outlet grouping can be dealt with very 
easily within the framework of multi-level matching. In this embodiment applied to outlet 
grouping, consider L2 output sub-elements within an output element constituting an outlet 
group, of which there are M. Matching is still perfbmied in multiple levels. For example, 

15 with just two levels of matching, the first (highest) level would be between input elements 
and output elements and the second level of matching would be between input sub- 
elements and output elements. Since both levels match to output elements rather than 
sub-elements, the number of cell or packet requests that can be accepted to each output 
element is obviously L2 times greater than to each individual output sub-element. 

20 

Accordingly, process for resolving contention when scheduling traffic across an input- 
queued switch anrangement is provided by the invention. The process is also capable of 
resolving service contention across a circuit switch arrangement. The process involves a 
method to match seivice requests between a number of input sub-elements and a number 

25 of output sub-elements. The input sub-elements are aggregated into groups whose 
service requests are then matched to either the output sub-elements or to aggregations of 
the output sub-elements. The individual input sub-elements of each aggregation of input 
sub-elements are then matched to the output sub-elements or to the aggregation of output 
sub-elements. This provides a hierarchical, two-level matching process. More generally, 

30 a matching process Is provided for a number N of first elements, each first element 
arranged to at least provide Ingress to a switch arrangement, each of the first N elements 
comprising a number Li of first sub-elements, the switch arrangement having a number 
ML2 of second sub-elements an-anged to at least provide egress from said switch 
arrangement, and wherein each of the first Li sub-elements is capable of conveying a 

35 service request for at least one of said second sub-elements ML2, wherein the process 
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comprises: firstly, for every one of tlie N first elements, aggregating service requests 
from all Li first sub-elements to each of tlie I^L2 second sub-elements or to each of the IVI 
aggregations of L2 second sub-elements, and secondly, resolving contention for said 
service requests from all N first elements to one or more of said second ML2 sub-elements 
5 or of said IVI aggregations of L2 second sub-elements, and thirdly, for each first element, 
resolving contention between the Li sub-elements and said second ML2 sub-elements or 
said M aggregations of L2 second sub-elements. The matching process can be extended 
to any number of hierarchical levels by considering elements in one hierarchical level as 
sub-elements in a higher level. Matching is performed first at the highest level of the 
10 hierarchy, then in turn down through the matching levels to the lowest matching level of 
the hierarchy. 



